Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420220140040035
Phonetics and Speech Sciences
2022 Volume.14 No. 4 p.35 ~ p.43
Performance comparison on vocal cords disordered voice discrimination via machine learning methods
Jo Cheol-Woo

Wang Soo-Geun
Kwon Ick-Hwan
Abstract
This paper studies how to improve the identification rate of laryngeal disability speech data by convolutional neural network (CNN) and machine learning ensemble learning methods. In general, the number of laryngeal dysfunction speech data is small, so even if identifiers are constructed by statistical methods, the phenomenon caused by overfitting depending on the training method can lead to a decrease the identification rate when exposed to external data. In this work, we try to combine results derived from CNN models and machine learning models with various accuracy in a multi-voting manner to ensure improved classification efficiency compared to the original trained models. The Pusan National University Hospital (PNUH) dataset was used to train and validate algorithms. The dataset contains normal voice and voice data of benign and malignant tumors. In the experiment, an attempt was made to distinguish between normal and benign tumors and malignant tumors. As a result of the experiment, the random forest method was found to be the best ensemble method and showed an identification rate of 85%.
KEYWORD
diagnosis, glottic cancer, vocal cords disorder, machine learning, convolutional neural network
FullTexts / Linksout information
Listed journal information